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Decorrelation of Signals 

This invention relates to decorrelation of signals; more particularly it relates to 
decorrelation of signals that have undergone convolutive mixing, and to a method, an 
apparatus and a computer program for implementing decorrelation. 

5 Decorrelation is known: it is a form of signal processing implemented by software running 
on a computer system and accepting data from sensors after conversion from analogue to 
digital form. Two assumptions are normally made in decorrelation, stationarity and 
linearity, and these assumptions are also made in connection with the present invention. 
Stationarity means that signals and channels in which they mix do not change over a time 

10 interval during which mixed signals are sampled. Linearity means that mixtures of signals 
received by sensors are linear combinations of these signals. More complicated 
combinations featuring signal products and squares and higher order powers of signals 
are not considered. 

The aim of decorrelation is to remove similarities between input signals, thereby 
15 eliminating redundancies. In some circumstances this may lead to recovering the signals 
as they were prior to mixing, but more usually it is used as a pre-processing step in an 
unmixing algorithm, or as part of another algorithm such as a data compression algorithm. 
A common usage is to separate the signals which carry meaningful data from those that 
arise due entirely to noise in the environment and in a receiver system which detects the 
20 signals: such separation may be an end in itself, or it may be used as part of a data 
compression scheme. 

Decorrelation is a process of removing correlations or similarities between signal pairs in a 
set of signals: correlation of a pair of signals is itself defined mathematically as an integral 
of the signal pair's product over time. It is important that as little information as possible 
25 (preferably none) is destroyed in decorrelation process, as would occur for example if 
decorrelation led to one of the signals being zeroed. Use of energy preserving transforms 
for decorrelation avoids information being destroyed and leads to output signals with other 
useful properties. 

30 If an energy preserving transform is used, it is usually required to recover decorrelated 
signals in decreasing order of power. This allows data compression techniques and 
signal/noise subspace separation techniques to be implemented very simply. 
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Algorithms for decorrelating signals that have been instantaneously mixed are known: they 
may use either transforms that are energy preserving or those that are not. 

Unfortunately* an algorithm which is adequate for decorrelating signals that have been 
5 instantaneously mixed cannot cope with more difficult problems. These more difficult 
problems occur when an output signal from a sensor must be expressed mathematically 
as a convolution, i.e. a combination of a series of replicas of a signal relatively delayed 
with respect to one another. It is therefore referred to as the "convolutive mixing" problem. 

The approach used in instantaneous algorithms has been extended to the convolutive 
10 mixing situation: from this approach it has been inferred that convolutively mixed signals 
can be decorrelated by the use of an energy preserving technique. This algorithm would 
accommodate time delays involved in mixing and decorrelation. The criterion to be 
imposed has however changed: i.e. pointwise decorrelation (as defined later) is no longer 
sufficient, instead a wider condition involving decorrelation with time delays should be 
15 imposed. This will be referred to as wide sense decorrelation, and imposing this property 
as imposing strong decorrelation. 

• Instead of the unitary matrix employed in instantaneous algorithms, an algorithm for 
imposing strong decorrelation by an energy preserving filter employs a paraunitary matrix. 
As will be described later in more detail, a paraunitary matrix is one which gives the 
20 identity matrix when multiplied by its paraconjugate matrix - a polynomial equivalent of a 
Hermitian conjugate matrix. A possible approach for the strong decorrelation problem is 
therefore to search for a paraunitary matrix that maximises a measure of the wide sense 
decorrelation. 

In this case, it may be possible to improve on the mere return of signals in decreasing 
25 order of power. It may be possible to provide for return of signals in decreasing order of 
power at each frequency in a set of frequencies. Thus a first signal has more power than 
other signals at all frequencies and a second signal has more power than all other signals 
apart from the first at all frequencies etc. This property is called spectral majorisation. 
Spectral majorisation is useful if the algorithm is to be used for data compression or for 
30 separating signal and noise subspaces. 
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In "Theory of Optimal Orthonormal Filter Banks" ICASSP 1996, P.P.Vaidyanathan 
discloses a use for filtering techniques where both strong decorrelation and spectral 
majorisation are necessary, neither being sufficient without the other. 

The simplest prior art method of imposing strong decorrelation involves using a 
5 multichannel whitening lattice filter. See for example S. Haykin, "Adaptive Filter Theory, 
Prentice Hall, 1991, although there are many references for these techniques, based on 
slightly different algorithms designed with different convergence and stability properties in 
mind. These techniques were not designed to impose strong decorrelation directly: instead 
they aim to recover an innovations sequence of the mixed signals. The components of the 
10 innovations sequence so produced are strongly decorrelated as a consequence of what 
they are. However these techniques are not energy preserving, and so are not suitable for 
use in many scenarios. 

In " Principal Component Filter Banks for Optimal Multiresolution Analysis, IEEE 
transactions on Signal Processing Vol 43 August 1995, M.K. Tsatsanis and G.B. 

15 Giannakis discuss how to find an optimal energy preserving and spectral majorising filter. 
This reference uses a frequency domain technique to show how an optimal filter can be 
found if infinite information is held about the signals and no constraint is placed upon the 
order of the filter. However it does not show either how this can be transferred to a data 
dependent algorithm, or how the filter can be constrained to a sensible order without 

20 losing optimality. However its calculations provide absolute performance bounds, and 
formed the basis of several of algorithms discussed below. 

In "Multirate Systems and Filter Banks", Prentice Hall: Signal Processing Series, 1993, 
P.P. Vaidyanathan discloses parameterisation of paraunitary matrices in a stage-by-stage 
decomposition of a paraunitary matrix in z A \ here z A is an operator implementing a 

25 delay. Vaidyanathan shows that a product matrix built up from a series of pairs of 
paraunitary matrix blocks is paraunitary: here, in a pair of blocks, one block represents a 
delay and the other a 2 by 2 unitary matrix implementing a Givens rotation (see US Pat No 
4,727,503). Vaidyanathan- also shows that a paraunitary matrix of degree N, defined later, 
is a product of N+1 rotations and N one-channel delay operators all implementing the 

30 same unit delay. 

In "Attainable Error Bounds in Multirate Adaptive Lossless FIR Fitters? , ICASSP 1 995, P.A. 
Regalia and D Huang propose two different eigenvalue-based algorithms for strong 
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decorrelation. Their algorithms are recursive, limited to two input signals and based upon 
a fixed degree lossless filter with parameters disclosed by Vaidyanathan. It is also 
necessary to access data from various points internal to the Vaidyanathan decomposition 
for use in adjusting the parameters: this adds an extra layer of processing complexity. 

5 In u Rational Subspace Estimation using Adaptive Lossless Filter^ IEEE transactions on 
Signal Processing Vol 40 Oct1992 P.A. Regalia and P. Loubatton disclose a different 
algorithm for attempting to impose both strong decorrelation and spectral majorisation. 
This algorithm is applicable to more than two signals,- but it is based upon a deflationary 
approach where an optimal single signal output is first calculated, and then augmented 

10 with other outputs until one of the latter becomes insignificant. Again a fixed degree and 
fixed order lossless filter is parameterised as disclosed by Vaidyanathan, and the 
parameters are adjusted according to a stochastic algorithm, which requires information 
on the data flow inside the Vaidyanathan decomposition. 

A different algorithm is proposed in "Design of Signal-Adapted FIR Paraunitary Filter 
15 Bank? ICASSP 1996 by P. Moulin, M. Anitescu, K.O. Kortanek and F. Potra. This uses a 
slightly different version of the Vaidyanathan parameterisation with a fixed degree. From 
the effects of a parameterised filter on one single output channel, this algorithm shows 
how to convert a problem of maximising power in one channel into a linear semi-infinite 
programming problem, which has known methods for solution. In effect the algorithm has 
20 an objective of achieving both strong decorrelation and spectral majorisation; it has a cost 
function which is very similar to a known N° cost function to be described in more detail 
later. However it forces the paraunitary filter to be of a fixed degree, and relies upon a link 
between parameters and cost function being computationally tractable. For different cost 
functions this would not be the case. 

25 Gradient descent methods are known which aim to adjust all parameters of a paraunitary 
matrix or an unmixing filter simultaneously: these methods have another difficulty, i.e. they 
have parameters linked to any useful measure of independence in a very complex way 
which does not factorise easily. This means adjusting all parameters at once: it leads to 
very slow algorithm convergence, which is why none of the algorithms previously 

30 mentioned use gradient descent. Gradient descent tends to be avoided in the prior art due 
to problems with speed and accuracy of convergence. 
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It is an object of the invention- to provide an alternative method of strongly decorrelating 
signals. 

The present invention provides a method for strong decorrelation of signals which are 
input, characterised in that it includes the steps of:- 

a) processing the input signals to determine delay and rotation parameters which 
implement at least one elementary paraunitary matrix and transform the input 
signals into output signals to obtain improvement in a measure of strong 
decorrelation; 

b) assessing the improvement in the measure of strong decorrelation, and if it is 
significant designating the output signals as input signals and iterating step a) and 
this step b); 

c) if the improvement is not significant designating the output signals as signals 
decorrelated in a wide sense. 

The invention provides a technique for wide sense decorrelation of signals which have 
been convolutively mixed, even if mixed using unknown filters. It is suitable, for example, 
for signals which have undergone multipath reflection, or acoustic signals in a reverberant 
environment. It is believed that the method of the invention is more reliable and requires 
less calculation than any alternative currently known. By virtue of use of an elementary 
paraunitary matrix, decorrelation is carried out using an energy preserving transform, and 
examples of the invention may (not necessarily but if required) impose spectral 
majorisation on the signals as well as imposing wide sense decorrelation. 

To transform the input signals, the method of the invention may determine in step a) delay 
and rotation parameters which characterise a single elementary paraunitary matrix. It may 
include producing a paraunitary matrix by cumulatively multiplying successive elementary 
25 paraunitary matrices produced by iterating step a). 

The range of signal delay parameters may be a set of discrete delay vectors, and the 
delay and rotation parameters may be determined by generating a respective version of 
the input signals delayed by each delay vector in the set, and for each version finding 
rotation parameters which at least approach producing maximisation of the output signals' 
30 strong decorrelation. The rotation parameters which at least approach producing 
maximisation of the output signals' strong decorrelation may be determined using an 
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algorithm for instantaneous decorrelation. The at least one elementary paraunitary matrix 
may implement at least one leading delay, rotation and terminal delay. 

The method may involve n input signals where n is an integer greater than 2, the range of 
signal delay parameters being a set of n-element delay vectors and the range of signal 

5 rotation parameters being a set of n(n- 1)/2 angle parameters. Step a) may comprise 
determining delay and rotation parameters which implement at least one elementary 
paraunitary matrix providing for rotation of a pair of input signals and relative delay of the 
or as the case may be each other input signal. The n input signals may be associated with 
respective channels, step a) having n(n - 1 )/2 successive stages each associated with at 

10 least one respective elementary paraunitary matrix and each providing for rotation of 
signals associated with a respective pair of channels and provision of relative delay 
associated with the or as the case may be each other channel, the first stage being 
arranged to process the input signals and the or as the case may be each subsequent 
stage being arranged to receive signals processed in the respective preceding stage. 



15 The method of the invention may involve a set of n input signals where n is an integer 
greater than 2, characterised in that it comprises: 

a) producing n(n - 1 )/2 replicas of the set of input signals, 

b) in each replica selecting a respective signal pair differing to that selected in other 
replicas, and 

20 c) the step of processing the input signals to determine delay and rotation parameters 
being carried out for each replica and comprising 

i) determining delay and rotation parameters which implement at least one 
elementary paraunitary matrix providing for rotation of the respective selected 
signal pair only, and 

25 ii) determining which replica when transformed by the associated at least one 

elementary paraunitary matrix gives rise to transformed signals corresponding 
to improvement in a measure of strong decorrelation by at least a major part of 
a maximum extent obtainable over the replicas and designating these 
transformed signals as output signals. 



30 



In another aspect, the present invention provides computer apparatus for strong 
decorrelation of signals, the apparatus being programmed for reception of input signals, 
characterised in that the apparatus is also programmed:- 
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a) to process the input signals to determine delay and rotation parameters which 
implement at least one elementary paraunitary matrix and transform the input signals 
into output signals to obtain improvement in a measure of strong decorrelation; 

b) to assess the improvement in the measure of strong decorrelation, and if it is 
5 significant designate the output signals as input signals and iterating a) and b); 

c) if the improvement is not significant to designate the output signals as signals 
decorrelated in a wide sense. 

In a further aspect, the present invention provides a computer programme for 
implementing strong decorrelation of signals input to computer apparatus, characterised in 
10 that it has instructions for controlling computer apparatus:- 

d) to process the input signals to determine delay and rotation parameters which 
implement at least one elementary paraunitary matrix and transform the input signals 
into output signals to obtain improvement in a measure of strong decorrelation; 

e) to assess the improvement in the measure of strong decorrelation, and if it is 
15 significant designate the output signals as input signals and iterating a) and b); 

f) if the improvement is not significant to designate the output signals as signals 
decorrelated in a wide sense. 

The apparatus and computer programme aspects of the invention may include features 
equivalent to those mentioned above in connection with the method of the invention. 

20 In order that the invention might be more fully understood, an embodiment thereof will now 
be described, by way of example only, with reference to the accompanying drawings, in 
which:- 

Figure 1 illustrates decomposition of a paraunitary matrix into degree-one paraunitary 
matrices in a prior art convolutive unmixing process; 

25 Figure 2 is a schematic block diagram of decomposition of a paraunitary matrix into 
elementary paraunitary matrices in a convolutive wide sense decorrelation 
process of the invention; 

Figure 3 is a schematic block diagram illustrating production of a paraunitary matrix from 
elementary paraunitary matrices in a process of the invention; 



30 



Figure 4 illustrates production of elementary paraunitary matrices in a process of the 
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invention; 

Figure 5 illustrates decomposition of a paraunitary matrix into elementary paraunitary 

matrices in a prior art wide sense decorrelation process involving more than two 
signal channels; 

5 Figures 6 and 7 schematically illustrate alternative wide sense decorrelation processes of 
the invention involving more than two signal channels; 

Figure 8 illustrates implementation of an alternative form of elementary paraunitary 
matrix involving a leading delay, a rotation and a terminal delay; and 

Figure 9 illustrates use of the Figure 8 elementary paraunitary matrix in an equivalent of 
10 the Figure 4 process. 

The present invention is arrived at by modifying and extending the prior art of both 
instantaneously decorrelating signals and of strongly decorrelating signals: this prior art 
will first be discussed in more detail to enable the invention to be better understood. 

Wide sense decorrelation is based entirely upon second order properties of data. The 
15 equality at (1) below holds if and only if the instantaneous decorrelation property holds for 
the pair of signals x p x 2 . 

E(x l x 1 ) = E(x } )E(x 2 ) (1) 

Here E is the expectation operator, and a product x t x 2 represents the signal obtained by 
multiplying each sample of a signal Xi by a respective sample of a signal x 2 having the 
same sample time: this defines pointwise decorrelation of a pair of signals. 

20 Expectations can be estimated from sampled signals to determine whether or not 
Equation (1) holds for sampled signals. If Equation (1) holds for all signal pairs then the 
signals are referred to as being decorrelated. 

Mathematically, instantaneous mixing of two signals from sources (e.g. loudspeakers) and 
their receipt by two receivers can be represented as follows:- 
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x l (t) = h ll s l (t) + h il s 2 (t) 
x 2 (f)^h 2l s l (f)^h n s 2 (f) 

Here loudspeaker outputs (original signals) reaching the two receivers at time (t) are 
represented by s x (t) and s 2 (t) : t is in units of an interval between successive digital signal 
samples (clock cycles), and has integer values 1 , 2 etc. The path from each loudspeaker 
5 to each receiver and the receiver's reception characteristics give rise to a constant 
. channel modulation. In the instantaneous mixing problem, this modulation is represented 
by multiplier terms /z () . , where i = 1 or 2 indicates first or second receiver and j = 1 or 2 
indicates first or second loudspeaker respectively. First and second loudspeaker signals at 
time (t) are represented by ^(fjand x 2 (t). Each receiver signal is the sum of two 
10 loudspeaker signals, each multiplied by a respective coefficient. These equations can be 
represented more compactly in matrix-vector format as:- 

x(0 = Hs(f) (3) 

where s is a 2 by 1 element vector of two loudspeaker signals, H is a 2 by 2 element 
mixing matrix and x is a 2 by 1 element vector of two received signals, one at each 
15 receiver. H is referred to as the mixing matrix because it has matrix * elements 
(coefficients) that operate on the loudspeaker signals and mix them to form the received 
signals. The sizes of s , x , and H will increase if more than two signals or more than two 
sensors are involved. As previously mentioned, algorithms for solving the instantaneous 
decorrelation problem produced by Equation (3) exist in the prior art. 

20 Mathematically, a correlation matrix R of a set of signals expressed as a vector xcan be 
defined as:- 

R = £((x-E(x))(x-£(x))*) (A) 

where E(x) is the expectation of x, that is the mean of x over time and the superscript 
index H means a Hermitian conjugate. In Equation (4) E(x) is subtracted from x, which is 
25 the first step in all processing that follows. However, to simplify notation it is usually 
assumed that signals have zero mean values, and this assumption will be used in what 
follows. This means that Equation (4) simplifies to R = £(xx*). However, it is not 
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necessary for signals to have zero mean values in order for the invention to be effective. 

To impose decorrelation upon mixed signals, it is necessary to force the correlation matrix 
R to have off-diagonal coefficients which are equal to zero. This is the instantaneous 
decorrelation problem. It can be done, for example, by ensuring that a correct amount of 

5 received signal 1 is subtracted from received signal 2 to ensure that the resulting output is 
decorrelated from received signal 1 . This is repeated for all other signals, producing a set 
of outputs decorrelated with received signal 1 , which is then defined as an output 1 . The 
remaining^outputs are then decorrelated with each other by repeating this procedure for 
them. It is equivalent to obtaining decorrelated signals x' by pre-multiplying signals x by 

10 a lower triangular matrix L , i.e. a matrix having all above diagonal coefficients equal to 
zero. Instantaneously decorrelated signals are designated x' and have a correlation 
matrix R' defined by:- 

R' = E(x'x'* ) = £(Lxx*L" ) = D (5) 

Here the new correlation matrix is shown as D, which is diagonal and positive 
15 semi-definite. 

In order that instantaneous decorrelation algorithms be capable of imposing second order 
independence, a relative propagation delay between a time sample of a signal from a 
signal source reaching two separate sensors must representable as a simple phase shift 
applied to a signal received by one of the sensors. For N sensors this becomes N - 1 
20 phase shifts. If this condition does not hold, channels from signal sources to sensors may 
be convolutive and in consequence instantaneous decorrelation algorithms will fail to 
impose second order independence. 

One-channel convolution and its analysis using z-transforms will now be considered. In a 
single signal, single receiver arrangement, the loudspeaker signal consists of a series of 
25 values indexed by the respective times t at which they were obtained and denoted by s(t) . 
The loudspeaker signal passes through a channel and is received by the receiver that 
detects another series of values, x(t) . The link between s(t) and x(t) may not be a 
simple multiplication as in instantaneous mixing. Instead x(t) may consist of a linear sum 
of several of earlier values of s(t) . This is known as a' convolution, and is shown by:- 
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x(t) = h ® s(t) = £ 7z(*Mr - *) (6) 



where ® is referred to as a "convolution operator". Generation of a linear sum of values of 
s(t) may occur because signals pass through an irregular channel which spreads them 
out. It may also occur from multi-path effects, i.e. when there is more than one path a 
5 signal can take to reach a receiver and the paths have differing propagation times. 
Instantaneous mixing is a simple case of convolutive mixing. 

Convolution is more complex and involves more algebra than simple multiplication by a 
coefficient. However it is possible to reduce it to a multiplication by the use of z-transforms 
as follows. The z-transforms of s,x and h are:- 

s(z) = s(0) + s(l)z~ l + s(2)z~ 2 + *(3)z~ 3 + • • • 
10 x(z) = x(0) + x^z' 1 + x(2)z' 2 + *(3)z- 3 +>- (7) 

h(z) = h(0) + h(l)z" 1 + K2)z~ 2 + ■ • • + h(p)z~ p 

Here z~ l is referred to as a delay operator as mentioned above. The z-transform of a 
signal expressed as a time series of sample values is a polynomial in z -1 . Given the form 
of the z-transform, its polynomial coefficients enable recovery of the original signal. When 
in their z-transform forms, the convolution of s by h as shown in Equation (6) becomes a 
15 multiplication of two polynomials. Thus:- 

x(z) = h(z)s(z) (8) 

Convolutive mixing may be addressed by combining instantaneous mixing as per Equation 
(1) with the more complicated approach of Equations (6) to (8). This deals with a plurality 
of sources and a plurality of receivers with intervening channels which are convolutive. 
20 Thus for the two-signal, two-receiver case the following expressions for received signals 
can be written:- 



x 1 (f) = /z n ®s l (0 + ft l2 ®s 2 (0 
x 2 (t) = li 2l ®s l (t) + h zl ®s 2 (t) 



0) 
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In the matrix-vector notation of Equation (3), and using the z-transform notation of 
Equations (7) and (8), Equation (9) can be written simply as:- 



Here s(z) is a 2 by 1 coefficient vector formed from z-transforms of loudspeaker outputs. 
5 H(z) is a 2 by 2 coefficient polynomial mixing matrix and x(z) is the 2 by 1 vector formed 
from z-transforms of received signals. Again if more than two loudspeakers and/or 
receivers are present, the sizes of s(z) , x(z) and H(z) increase accordingly. 

H(z) is called a polynomial mixing matrix because it represents mixing of signals: its 

coefficients are polynomials in z~ l ." It may also be considered to be a polynomial in z" 1 
10 with coefficients which are matrices. It is found one coefficient at a time by taking z- 
transforms of individual loudspeaker/receiver channels. For future convenience, the order 
and degree of a polynomial matrix will now be defined. The order of a polynomial matrix is 
the. greatest power to which z" 1 is raised in the matrix. The degree of a polynomial matrix 
is the smallest number of delays necessary to implement it as a filter. It is always at least 
15 the same size as the order but may be greater. 



The instantaneous decorrelation problem only deals with removing correlation between the 
signals with no time delays applied to them. In order to impose wide sense decorrelation 
the measure of decorrelation has to operate across time delays. That is, to be strongly 
decorrelated, the correlation between two signals has to be zero when any time delay is 
20 applied to one of the signals. This can be written mathematically as:- 



where We ( 2,-1,0,1,2, •••) means all values of delay d (in units of a time interval 

between successive signal samples) in the set of all positive and negative whole numbers, 
and D(d) is a diagonal matrix (all off-diagonal coefficients zero) for all values of d . It is 
25 possible to put Equation (1 1 ) into z-transform form as follows:- 



x(z) = H(z)s(z) 



(10) 



R(d) = E(x(t)x(t -<**)) = D(d) Vd 6 (• • ~ 2-1,0,1,2, » •) (11) 



R(z) = £(x(z)x(z)) = D(z) 



(12) 
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where D(z) is a diagonal polynomial matrix. Equation (12) introduces the concept of a 
paraconjugate x(z)of a polynomial matrix x(z) denoted by a tilde above the matrix 
symbol x . It means combined operations of transposition and conjugation (as in the 
Hermitian operator) and the substitution of II z* for z . Paraconjugation is an extension of 
the Hermitian operator to polynomial matrices. 

To achieve wide sense decorrelation it is necessary to force all off-diagonal terms of a 
polynomial correlation matrix to become the zero polynomial, i.e. the polynomial in z * 1 
having all its coefficients equal to zero. No constraints are put upon the polynomial 
functions of the matrix on its diagonal. Thus:- 



10 



R'(*) = 



*i(z) 0 
0 <r 2 (z) 



(13) 



The unit polynomial is the polynomial that has zero coefficients for all powers of z 
except for z° which has unity as its coefficient. In the prior art a multichannel lattice filter, 
see e.g. S. Haykin, "Adaptive Filter Theoof\ Prentice Hall, 1991, imposes strong 
15 decorrelation while ensuring that the diagonal terms of the polynomial correlation matrix 
are the same by forcing them all to be the unit polynomial. This whitens the signals. 

The polynomial correlation matrix of the signals x(z) is given by Equation (12) above. A 
multichannel lattice filter transforms these to obtain a matrix of strongly decorrelated and 
whitened signals x'(z) . These signals have the property that their correlation matrix R'(z) 

20 is the identity polynomial matrix I, symbolically: - 

R'(z) = £(x'(z)x'00) = I 04) 

The fact that this method imposes the whitening condition upon the signals means that it 
carries out more calculation than is necessary and has not necessarily used an energy 
preserving transform. It can also lead to difficulties when original signals driving the mixing 
25 were not white, or when mixing nulls some of their frequency components, as it whitens 
the outputs and thus amplifies the noise in those frequencies that were nulled. 
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Instantaneous algorithms that use energy preserving transforms achieve decorrelation by 
the use of unitary matrices. These are energy preserving, and are defined as those 
matrices that when multiplied by their Hermitian conjugate give the identity. The extension 
of the Hermitian operator to the space of polynomial matrices is the operation of 
5 paraconjugation. This leads to the definition of paraunitary matrices as those polynomial 
matrices that give the identity matrix when multiplied by their paraconjugate. Symbolically 

H(z) is a paraunitary matrix if and only if H(z)H(z) = H(z)H(z) = I . 

Paraunitary matrices are energy preserving and, more strongly, are energy preserving at 
all frequencies. This means that if decorrelation is achieved by the use of paraunitary 
10 matrices, the problems of whitening mentioned above are avoided. 

As previously mentioned, the Vaidyanathan reference discloses parameterisation of the 
space of all paraunitary matrices. This gives a stage-by-stage decomposition of a 
paraunitary matrix in z~ l . The parameterisation is illustrated in Figure 1 for a two-channel 
case. It comprises a series of N+1 rotation blocks Q 0 , Qi, ... Qn, adjacent pairs of these 
15 blocks being separated by individual delay blocks A(z) which are all alike. Upper and lower 
signal channels 20 and 22 pass through all the blocks. Amplifiers 24 indicate channel 
input signal scaling factors a, of which the upper channel factor is positive and the lower 
channel factor is positive or negative as indicated by "+-" before "a". 

Each of the rotation blocks Q 0 to Q N implements a respective 2 by 2 unitary matrix which 
20 itself implements a Givens rotation (see US Pat No 4,727,503): a Givens rotation is a 
rotation parameterised by a single rotation angle 0. In the first rotation block Q 0l a signal 
in the lower channel 22 is multiplied by Givens rotation parameters s 0 and c 0 ; a signal in 
the upper channel 20 is multiplied by Givens rotation parameters -s 0 and c 0; s 0 and Co are 
respectively the sine and cosine of the rotation angle 6 implemented by the block Q 0 . 
25 Each c 0 product is summed with the s 0 or -s 0 product from the other channel to provide a 
sum which passes along the respective channel 20 or 22 to the next block A(z). This next 
block A(z) delays the signal in the lower channel 22 with respect to that in the upper 
channel 20. This procedure is then repeated in subsequent rotation block ^ to Q N , with 
equal intervening delays A(z) in the lower channel 22. 



30 The rotation blocks Q 0 to Q N and delays A(z) are themselves paraunitary, and so their 
product is a matrix which is also paraunitary. Vaidyanathan proved that any finite degree N 
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paraunitary matrix can be expressed in this form. Thus apart from a scalar factor, any 
finite degree paraunitary matrix can be expressed as follows:- 

H w U) = Q N .-A(z)Q I A(z)Q 0 (15) 

where N is the degree of the paraunitary matrix. Thus a paraunitary matrix H N (z) of 

5 degree N is the product of N+1 Givens rotations and N one-channel delay operators of 
equal delays, as shown by the concatenated stages in Figure 1 . 

However, there is a difficulty with using Vaidyanathan parameterisation, a simple example 
of which is given here: consider signals which form an independent identically distributed 
(iid) time series. Assume these signals have been mixed by a paraunitary mixing matrix. 
10 Let the paraunitary mixing matrix have degree 1 and be formed, using Vaidyanathan 
parameterisation, as:- 

H„(z) = Q l A(z)Q 0 

_ "1 Oil OTcos(0 o ) sin(0 o )l (16) 
" _° IJL 0 z'l-sin^) cos(0 o )J 

i.e. the last of the two Givens rotations (within the left hand pair of square brackets) is in 
fact the identity. It has been discovered in accordance with the invention that attempts to 
impose strong decorrelation on signals mixed by H N (z) should begin by undoing the 

delay, not by applying a rotation. However, in applying a single block type approach, the 
first step is to try to find an appropriate rotation to apply, even though one is not needed. 
This superfluous rotation is very difficult for later blocks to undo, and its application can 
introduce more correlation between the signals, which is precisely the opposite of the 
required decorrelation. 

The present invention is intended to overcome the difficulties indicated above, and to 
avoid superfluous calculations. It has similarities with Vaidyanathan's parameterisation but 
unlike the latter it employs variable delays not necessarily confined to one channel and not 
necessarily equal; moreover it does not force a rotation to be applied when there is little or 
25 nothing to gain, and allows arbitrarily sized paraunitary matrices to be constructed. 
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Figure 2 illustrates the construction of a paraunitary matrix H N (z) as a product of N+1 

elementary paraunitary matrices in accordance with the invention. An elementary 
paraunitary matrix is an expression coined for the purposes of the present invention: it is 
defined as a polynomial matrix which represents applying a set of delays to the signals, 

5 followed by a single unitary transformation (rotation); as will be described later in more 
detail, this definition may optionally include the rotation being followed by a further set of 
delays (omitted in the present example), in Figure 2 and as in Figure 1, for the sake of 
simplicity, only the two-channel case is shown as indicated by upper and lower channels 
30 and 31 . Each of a set of dotted line blocks Vj(z) (i = 0, 1 , to N) represents a respective 

10 single elementary paraunitary matrix that implements a respective delay dj followed by a 
Givens rotation. N+1 of the blocks V t (z) are used to make up the paraunitary matrix 

H,<*). 

Each of the blocks V 0 (z) to V N (z) implements a respective 2 by 2 elementary paraunitary 
matrix which itself implements a Givens rotation. In the first block V 0 (z), signals in the 
15 upper and lower channels 30 and 31 pass into a matrix of four ganged single pole double 
throw switches 32 controlling routing or otherwise via a delay cell Z* 0 . The switches 32 are 
all either in the UP position or in the DOWN position: when DOWN, upper channel signals 
are delayed at Z d0 but lower channel signals are not, and when UP the reverse is the case. 
This enables either channel 30 or 31 to be delayed relative to the other. 

20 On leaving the switches 32, an upper channel signal is multiplied by Givens rotation 
parameters -s 0 and c 0 at amplifiers 33 and 34 respectively. Similarly, on leaving the 
switches 32, a lower channel signal is multiplied by Givens rotation parameters So and c 0 
at amplifiers 35 and 36 respectively. Here s 0 and Coare respectively the sine and cosine of 
a rotation angle 9 0 implemented by the block V 0 (z). Each c 0 product is summed with the s 0 

25 or -s 0 product involving the signal from the other channel: this provides sums, which pass 
along respective channels 30 and 31 to the next block V^z). This delay/rotate procedure is 
then repeated in subsequent blocks V^zjto V N (z). These later blocks will not be described 
further because they operate in the same way as the first block V 0 (z), except their delays 
and rotations will usually be different. Implementation of this procedure to find appropriate 

30 delays and rotations will be described later, following a theoretical discussion. 

A full paraunitary matrix parameterisation in accordance with the invention in terms of 
elementary paraunitary matrices can be expressed as follows:- 
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H„ 00 = Y dgA (zy . . V diA (z)Y doA (z) (1 7) 

The progress of imposing wide sense decorrelation upon signals in accordance with the 
invention uses a cost function that is calculated after every application of an elementary 
paraunitary matrix to the signals to determine whether or not an improvement has been 
5 achieved. The cost function is a measure of the signals' strong decorrelation, an example 
of which is a sum squared of second order measures of autocorrelation of the signals, and 
is at a maximum when all cross-correlation terms are at a minimum. Thus an increase in 
this cost function corresponds to a reduction in correlation between the signals. The cost 
function will be referred to as the Decorrelation Cost Function (hereinafter "DCF"). 

10 The process of the invention is to find an appropriate paraunitary matrix that imposes wide 
sense decorrelation on the signals. This matrix is not parameterised as disclosed by 
Vaidyanathan, but it is instead parameterised in terms of elementary paraunitary matrices 
(see Equation (17)) calculated sequentially. Once derived, each elementary paraunitary 
matrix is applied to the signals used to calculate it: this transforms the signals into a form 

15 suitable for calculating the next such matrix. 

To calculate an elementary paraunitary matrix a set of possible delays is considered. A 
latest elementary paraunitary matrix is chosen which maximises the DCF calculated for 
signals transformed by application of all such matrices calculated up and including the 
latest. To identify the latest elementary paraunitary matrix its delay and rotation 

20 parameters are required. There is a countable infinity of different possible delays, but an 
uncountable number of different rotation parameters. Realistically the countable infinity of 
possible delays will be reduced to a finite number of different delays to be considered, 
1 either by constraining them or by making the assumption of circulant statistics and hence 
constraining them to be less than the data length (number of samples of each signal used 

25 in decorrelation). 

Once a trial value of delay has been applied to the signals, rotation parameters s 0 and c 0 
are chosen. These parameters could be chosen by using a technique which explicitly 
maximises the DCF of output signals. 

It.is therefore possible to look at a range of trial delays and for each delay to attach the 
30 rotation that maximised the DCF after the application of the delay and the rotation. This 
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produces a set of possible elementary paraunitary matrices, each associated with a 
respective DCF found by applying it to signals and finding the DCF of the resulting output. 
Under certain assumptions it may be possible to avoid applying the elementary 
paraunitary matrices to the signals, and instead to calculate the DCF of the resulting 
5 output directly. This has computational advantages, but is only possible in some cases. 

The elementary paraunitary matrix that is selected is that having the best DCF output. This 
stage is repeated until there is no gain in DCF to be found, or the gain in DCF is regarded 
as being too small to warrant the extra complexity in the strongly decorrelating matrix. 

Thus the invention can be seen as a method of imposing wide sense decorrelation by 
10 sequentially applying a series of elementary paraunitary matrices each chosen to 
maximise an DCF measure in an output resulting from its use. Each elementary 
paraunitary matrix contains one rotation. Thus the process of the invention implements an 
algorithm referred to as the Sequential Best Rotation (SBR) algorithm. This algorithm 
comprises decomposition of the paraunitary matrix into elementary paraunitary matrices, 
15 and the use of a measure which can be meaningfully calculated at each stage of the 
algorithm not just at the end. The combination of these two steps separates the algorithm 
into a series of like stages that can be carried out in sequence. 

Strong decorrelation of signals is equivalent to a process of generating a strongly 
decorrelating matrix H(z) . This process will now be described. It is not essential to 
20 generate H(z) , but it permits the process to be tested: i.e. mixed signals which have 
undergone a prearranged form of convolutive mixing can be used to produce H(z) , and 

then H(z) , the paraconjugate of H(z) , can be compared with the prearranged form to 
see how faithfully the former reconstructs the latter. Moreover, it can be shown that under 
certain circumstances H(z) can be used to determine further information about original 
25 signals, such as arrival direction and frequency response. 

Referring now to Figure 3, a flow diagram is shown illustrating an iterative process of the 
invention to produce a strongly decorrelating polynomial paraunitary matrix R(z) as the 
product of a series of elementary paraunitary matrices. At the beginning of each 
successive iteration of the process there is a^respective current value h(z) of the evolving 
30 paraunitary matrix, i.e. the product of all elementary paraunitary matrices calculated up to 
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the present time. There is also a set of current signals derived using the most recently 
calculated elementary paraunitary matrix incorporated in h(z): h(z) represents the 
transformation which converts the original input mixed signals to obtain the current 
signals: it is improved repeatedly until applying a further elementary paraunitary matrix 
results in no significant improvement in DCF, at which point h(0 has become B(z) . 

The process starts at 60, where the initial current signals are the original input signals, and 
the initial current value of the paraunitary matrix h(z) is the identity matrix I . At 61 the 
current signals are used to produce a current elementary paraunitary matrix; at 62 this 
matrix is applied to the current signals to produce new signals. At 63 the new signals are 
tested to see whether their strong decorrelation properties have improved: if such an 
improvement has been obtained, at 64 the current elementary paraunitary matrix is used 
to pre-multiply h(z) to provide the latest current value of the evolving paraunitary matrix 
and the new signals become the latest current signals. The latest current value of h(z) 
and the latest current signals are then fed back to stage 61 for the procedure of stages 
61 , 62 and 63 to be repeated. 

If at 63 the new signals show no improvement over the current signals, the process is 
considered to have converged to a solution and it terminates at 65: the current value of 
h(z) is output as H(z) and the current signals are output as the required strongly 
decorrelated signals. Thus the use of elementary paraunitary matrices divides the problem 
20 of strongly decorrelating signals and obtaining H(z) into a number of sub-problems. 

Methods for finding each elementary paraunitary matrix at 61 and for deciding at 63 
whether or not the new signals are improvements over the current signals will now be 
described. The second relies upon a measure of strong decorrelation of signals being 
used. An accurate and comprehensive decorrelation cost function (DCF) is impossible to 
25 calculate from sampled signals, but it is possible to calculate various different measures of 
strong decorrelation. Some examples of good decorrelation cost functions will now be 
given. They are based upon the correlation terms r(x v x 2 ) of a set of signals defined by: 



10 



r(x 17 x 2 ) =E[x 1 x 2 ] 
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where E is the expectation operator and x, and x 2 are signals in the set: they may be 
different signals, or the same signals, and they may have delays applied to them. 

Decorrelation cost functions, denoted by Nf 9 are defined as the sum squared of all 
possible correlation terms in a set of signals. In the present example correlations are 
5 derived between each signal in the set and a relatively delayed replica of itself, i.e. there 
are no correlations between different signals: X! and x 2 in Equation (18) are therefore the 
same signal with a relative delay therebetween, and correlations are obtained for each of 
a set of delays up to a maximum delay of L The maximum delay L can be regarded as an 
indication of the locality of the cost function. If L = 0 the cost function is referred to as a 
10 pointwise cost function, if L is small, i.e. L<5, the cost function is referred to as a local cost 
function, otherwise it is a wide cost function. 

Changing the L value changes the properties of N{ t but for all values of L this is a 

sensible decorrelation cost function. In particular, the two cases of L=0 and L=max (i.e. 
the largest value that can be measured using the sampled signals) have been successfully 
15 used. The L=0 DCF leads to an algorithm which imposes spectral majorisation, while the 
L=max DCF leads to an algorithm that only imposes wide sense decorrelation, and not 
spectral majorisation. The process of the invention may use any convenient DCF, 
provided it is an accurate DCF and is used consistently throughout: specific problems may 
suggest specific DCFs. 

20 At 63 current signals and new signals are compared by calculating their DCFs: the signals 
having the higher DCF are accepted as being the better of the two. 

To obtain the latest elementary paraunitary matrix at 61 , it is necessary to derive two 
parameters which characterise such a matrix, a delay parameter d and a rotation 
parameter 0 . Thus identification of the elementary paraunitary matrix reduces to finding 
25 (d,0) . The parameter d is a vector parameter, and 9 can be a vector parameter if there 
are more than two signals. However in this example only the case of two signals is being 
considered, thus 0 is not a vector. 

To find the delay parameter d ; a set D of possible values for it is selected. A suitable set 
for the two-signal case is shown below in Equation (22), i.e. D may be represented as: 
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where clots indicate terms not shown, each inner pair of parentheses represents a delay 
vector 8 f having two elements, 0 in an upper or lower position in parentheses indicates an 

undelayed upper or lower signal channel respectively, and a non-zero value in such a 
5 position indicates a delayed channel located likewise: each non-zero value represents 
delaying the associated channel in each case relative to the other channel by an integer 
value not greater than A Delays are in units of a time interval t between successive signal 
samples, e.g. a delay (A - 1) implies (A - 1)t. There are also other possibilities for D , but 
before major departures from this scheme are implemented they should be assessed for 
10 validity. The number of elements or delay values in D is S , i.e. \d\ = S . 

Referring now to Figure 4,. there is shown a flow diagram indicating how elementary 
paraunitary matrices can be found. For all delay vectors S^o 5 S in D, respective 

elementary paraunitary matrices are derived, and then the DCFs of the signals these 
matrices create are calculated. The elementary paraunitary matrix corresponding to the 

15 best DCF is then selected for use. In the drawing, in the two-channel case a pair of current 
signals are input at 70. At 71 the signals are replicated S times, i.e. as many times as 
there are delay values inD. In the two channel case each replica is a pair of current 
signals: each replica pair is input to a respective signal channel 72j of which there are S , 
i.e. i = 1 to S . Three channels are shown, 72), 72 2 and 72 s , others being implied by chain 

20 lines such as 73. At 74t to 74 5 , each of the channels 72i to 72 s applies a respective 
delay vector 5 t , ....5 S from D to its replica, i.e. it delays one of the pair of current signals 
in its replica relative to the other: the ith channel 72j applies delay vector 8, to the ith 
replica, and i = 1 to S . In each of the channels 72i to 12 s the relative delay produces a 
pair of signals that are different from those input at 70, although they are no more strongly 

25 decorrelated than before. 

To make these signals as decorrelated as possible in a pointwise sense using only a 
rotation can be achieved by using an instantaneous algorithm. A review of many methods 
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of imposing instantaneous decorrelation, both with and without using energy preserving 
techniques, appears in "Matrix Computations", John Hopkins University Press 1989 by 
G.H. Golub and C.F. Van Loan. This covers a wide range of techniques developed over 
the last 150 years, and discusses numerical issues caused by the introduction of 
5 computers into the field. 

Suitable rotations may be calculated by one of several different techniques. For many 
choices of DCF it may be possible to calculate the rotation that explicitly maximises the 
DFC of the signals afer the application of the rotation. It may also be possible to try to find 
this rotation by a numerical approximation method. A third possibility is to use a pointwise 
10 decorrelation algorithm to find the rotation that imposes pointwise decorrelation; this 
should lead to an improvement in DCF to at least a predominant part of the maximum 
improvement attainable in this regard. These three rotation finding possibilities may 
coincide, as they do for the measure with L=0. 

Each channel 72) now has parameters associated with it consisting of a delay vector 5, 

15 and a rotation angle 0„ just as if an elementary paraunitary matrix implementing those 
parameters had been applied to the signals input at 70: moreover, when applied to input 
signals this matrix provides a rotation which produces output signals that are as 
decorrelated as they can be for the relevant value of delay element 8, . The first two 

elements 74j and 75,- of each channel 72\ therefore in combination simulate an elementary 
20 paraunitary matrix with a single set of delays and a respective parameter set 

(d,0 s (B fl £O- 

The next stage is to determine which of the channels 72i to 72 5 simulates an elementary 
paraunitary matrix producing the highest improvement in strong decorrelation. This is 
carried out at 76i to 16 Si where DCFs of signals output from stages 75 t to 75 5 

25 respectively are calculated. These DCFs are compared with one another at 77, and 
whichever channel produces the greatest DCF provides the highest improvement in strong 
decorrelation and simulates the best elementary paraunitary matrix: this matrix is selected 
as the output of the process indicated in Figure 4 for use at 61 in Figure 3. 

' On making certain assumptions regarding the input signals it is possible to avoid most of 
30 the calculations in this stage. It may be possible to predict the DCFs of signal outputs 
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from stages 75i to 75 s . If this is possible then it is not necessary to carry out the 

calculations for any of the channels except the one that will lead to the greatest increase in 
DCF. In general if this approach is valid, it is worth doing as the computational savings 
will be considerable. An example of this technique is now given. If the cost function is 
5 with L=0 then the maximum possible gain in DCF achievable for a given delay is simply 
the correlation between the two signals after that delay has been applied. By using the 
• circulant assumption, this correlation can be calculated for all possible delays by using 
Fourier transforms. This is a prior art technique for calculating the cross correlation 
between two signals, and is computationally inexpensive compared with calculating the 
10 cross correlations for each delay individually. Thus the delay which leads to the maximum 
cross correlation between the two signals can be selected, without having to apply delays 
and calculate rotations for each of the channels. 

It is not in fact essential to choose whichever channel produces the greatest DCF: another 
channel giving a value relatively close to this value would be acceptable. Also a channel 

15 might be selected to give a more convenient value for one of the parameters, e.g. a 
shorter delay. If however a channel is chosen giving a value insufficiently close to the 
greatest DCF, then the process of signal separation may fail because it might be 
impossible to rectify matters in subsequent iterations. So a channel should be selected 
which gives a DCF or improvement in decorrelation to at least a predominant part of the 

20 maximum in this regard obtainable over the ranges of delay and rotation parameters 
employed. 

The foregoing embodiment of the invention related to imposing wide sense decorrelation 
in a two-channel case, i.e. two signal sources, two receivers and two paths from each 
source to receivers. If there are more than two channels further processing is necessary. 
25 However the procedure of the invention remains fundamentally the same: i.e. find a 
paraunitary matrix to impose strong decorrelation on signals in the. channels, the matrix 
being a product of successive elementary paraunitary matrices each of which maximises a 
measure of strong decorrelation of outputs resulting from its application to input signals. 

There are several ways of extending the invention to more than two channels. One is a 
30 simple, direct extension of the previous embodiment. Others use signal pairs, attempt to 
impose pairwise independence, and sweep through all signal pairs in some order. 
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Figure 5 shows the decomposition of a paraunitary matrix with degree N for the multi- 
channel case (i.e. three or more channels) as given by the Vaidyanathan prior art 
previously mentioned. It is very similar to the two-channel case of Figure 1 . The difference 
is that Givens rotations, which are essentially two-channel operators, have been replaced 
5 with more general rotations operating on all channels at once. This can be expressed 
mathematically as:- 

H„(z) = R„---A(*)R I A(z)R 0 (20) 

where as before A(z) is a matrix representing successive unit delays in the lowermost 
channel, and R / (i = 0 to N) represents the ith rotation. In the n channel case each 
10 rotation has n(n-1 )/2 parameters, which reduces to one parameter when n = 2 for Givens 
rotations used in the two-channel case. 

The parameterisation of Equation (20) involves the same problems as in the two-channel 
case described with reference to Figure 1 . Here again the procedure used to deal with the 
problems is the decomposition of a paraunitary matrix into elementary paraunitary 

15 matrices: the elementary paraunitary matrices are found by considering the decomposition 
and constraining all but one rotation. One rotation is unconstrained, but all others are 
constrained to either leave inputs unchanged or to reorder channels so a different channel 
rece j ves a subsequent delay. Thus all rotations apart from one are constrained to be 
permutations. In the present example of an elementary paraunitary matrix with only one 

20 set of delays, the unconstrained rotation is the last rotation, and the matrix can thus be 
expressed as:- 

V d ,e(0 = R e A(2)-P 1 A(z)P 0 =R e D d (z) (21) 

Here P f is the ith permutation matrix and R e is the rotation matrix parameterised by a 
vector of n(n-1)/2 rotation angles 8. The combination of all permutation matrices and 
25 delay matrices is denoted by D d , which is referred to as the delay matrix. Its index, d , is a 
length n vector whose n elements are delays applied in respective channels. 

Equation (21) shows the extended form of an elementary paraunitary matrix. First a delay 
matrix is applied, which consists of differing delays being applied in all the channels. 
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Secondly a rotation is applied. As before this is a good choice for an elementary building 
block, as it contains only one operation that mixes the signals. However, this form is 
parameterisable, the parameters being the vectors d and 0. This means that the 
decomposition of a paraunitary matrix into elementary paraunitary matrices can be written 
5 as:- 

H(z) = V dN , 9w ( Z )-V diifli (z)V do , eo ( 2 ) (22) 

The methodology for finding this expansion is largely the same as in the two-channel 
case. It only requires modification in selecting D , i.e. the delay set or set of possible 
delays for forming the first of the series of elementary paraunitary matrices. In the two- 
10 channel case the elements of D were treated as delays applied to only one channel. In 
the more general case delays are allowed in more than one channel. As in the two- 
channel case there is a countable infinity of possibilities, so a sensible finite set has to be 
selected to form D . 

The number of elements in D could be fixed by setting an upper limit denoted by I to the 
15 number of delays in it. Thus D can consist of all possible ways of allocating up to I 
delays between n channels. Applying the same delay to all channels at the same time is 
equivalent to applying no delay at all, so all elements in D should have at least one 
channel that is not delayed. The number of elements in D can grow very fast as n and / 
increase. As before this scheme for D is merely selection of a sensible option. Other 
20 possibilities exist but usually need more justification to ensure validity. It is possible to 
justify some choices (such as allowing delays to be applied to only one channel) by saying 
they allow a higher / for a fixed size of D . Thus they allow the algorithm to consider 
longer delays, at the cost of not allowing it to implement more complicated combinations 
of smaller delays. 

25 However it is defined, once D is fixed the process of the invention can proceed to impose 
wide sense decorrelation in the multi-channel case in a similar way to the two-channel 
case of the previous embodiment. There are two differences: firstly, the two-element delay 
vectors 8 f in Figure 4 are replaced by delay vectors (elements of the delay set D) having 
n elements, where n is the number of channels. This allows the algorithm to apply an 

30 element of D to the signals. Secondly, the rotations calculated by known pointwise 
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decorrelation algorithms at stages 75i to 75 s are now n channel rotations. These can be 
obtained by successively applying n(n-1)/2 two channel rotations, and so can be 
parameterised by n(n -1)/2 angle parameters: Prior art pointwise decorrelation algorithms 
previously mentioned provide current signals and a rotation matrix which generates them, 
5 so the process will not be described further. 

This approach to the multi-channel case may give difficulty over the size of the delay set 
D : because each element in D is a vector, for each element a whole n by n pointwise 
decorrelation algorithm has to be carried out. As the delay set upper limit / grows this 
becomes the dominant time constraint and slows processing down. So-called "sweeping" 
10 algorithms were developed in the prior art to attempt to mitigate similar problems in the 
instantaneous case. 

The principle of using energy preserving transforms to zero the correlation between a set 
of signals has a long history. The first reference is "Uber ein Leichtes Verfahren Die in der 
Theorie der Sacularstroungen Vorkommendern Gleichungen Numerisch Aufzulosen" 
15 Crelle's J. 30 p51-94, 1846 by C.G.J. Jacobi. He introduced the idea of tackling the 
multichannel case by sweeping through pairs of signals in such a way as to strictly 
decrease the total measure of correlation between different signals in the problem at each 
stage. This technique is known as using Jacobi sweeps 

It has been shown that cycling through different signal pairings and imposing pairwise 
20 decorrelation leads to good decorrelation of several signals, provided that there are ' 
repeated sweeps through all available signal pairs. The technique for decorrelating several 
signals by repeated sweeps through all available signal pairs imposing pairwise 
decorrelation on each pair is known as the Jacobi algorithm. At any one stage it only 
works on making two signals pairwise decorrelated. 

25 This example of the invention provides a procedure called the sweeping process for 
finding an elementary paraunitary matrix based algorithm that follows the same approach 
used in the Jacobi algorithm. The procedure has a prearranged ordering of all available 
signal pairings through which it works in a sweep. Each signal pair is processed using the 
two-channel process of the first embodiment of the invention. The remaining signals are 

30 aligned in time (delayed) so they have at least approximately the same delay applied to 
them as the signal pair being processed. At the end of a sweep the same termination 
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condition from the previous embodiment of the invention can be applied to decide if 
another sweep needs to be carried out or if no more significant improvement is obtainable. 

Figure 6 illustrates this procedure for imposing wide sense decorrelation on three signals 
A, B and C. As mentioned earlier the process of the invention implements the Sequential 
5 Best Rotation or SBR algorithm. In the drawing, "SBR2" indicates application of the SBR 
algorithm for a two-channel case as in the previous embodiment. 

In an initial phase, signals A and B are processed with SBR2 at 90, and at 91 signal C is 
aligned in time for substantial synchronisation with the output from 90. In this example the 
elementary paraunitary matrices implement only one set of delays, and time alignment at 

10 91 is carried out by delaying C by half the order of the polynomial matrix determined at 90. 
In other words C is delayed in time by half the sum total of the delays applied in the 
processing of A and B irrespective of which channel the or (as the case may be) each 
delay lies in. Alignment of C in this way is not essential but has been found to give 
acceptable results. It is not necessary to allow for processing delay to implement SBR2 at 

15 90, because the signals A, B and C are sequences of digital data samples stored when 
not in use: "delaying" C relative to A or B merely means changing the sample number in a 
sequence input to a subsequent processing stage. 

At stages 90 and 91 in combination, at least one elementary paraunitary matrix is applied 
to all three signals: such a matrix can be generated by extending the or as the case may 
20 be each two channel elementary paraunitary matrix produced at 90 to cover three 
channels. The or each matrix applies rotation and delay to signals A and B but it applies a 
delay to signal C without rotating it. If more than one elementary paraunitary matrix is 
determined at 90, all delays applied to signal C must add up to the total delay applied to C 
at 91. 

25 In a second phase, signal B (after processing at 90) and signal C (after delay at 91 ) are 
processed with SBR2 at 92, and signal A (after processing at 90) is aligned in time at 93 
with the output from 92. In a third phase, signal A (after alignment at 93) and signal C 
(after processing at 92) are processed with SBR2 at 94, and signal B (after processing at 
92) is aligned in time at 95 with the output from 94. 

30 The next stage at 96 is to determine whether or not the signals have become sufficiently 
strongly decorrelated to warrant terminating the process, or whether it is necessary to 
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continue: as before this termination condition is satisfied if DCFs of the signals have not 
improved significantly: if so, processing terminates; if not a respective elementary 
paraunitary matrix determined at each of 90, 92 and 94 is used to pre-multiply a stored 
product of those preceding it and the latest resulting product replaces that previously 
5 stored; the latest current signals from SBR2 94 and alignment 95 are then fed back to 90 
and 91 via a feedback loop 97 for the procedure to be repeated. 

In this embodiment, one or more two-channel elementary paraunitary matrices may be 
applied in the SBR2 stages at 90, 92 and 94 for each circulation round the loop of items 
90 to 97. It is possible to apply as many elementary paraunitary matrices at these stages 
10 as one wishes, up to the number necessary to meet the termination condition for the 
SBR2 algorithm. It is preferred to apply only one elementary paraunitary matrix at the 
stages 90, 92 and 94 because it reduces the need for signal alignment at 91 , 93 and 95: it 
uses the philosophy of applying the more important rotations first, because earlier 
rotations in the SBR2 process tend to give more performance gain than later equivalents. 

15 If, rather than operating on signal pairs in a fixed order, the pair that leads to the greatest 
DCF increase is operated on first, another approach for imposing wide sense 
decorrelation can be produced. This approach also operates on pairs, but pair ordering is 
not fixed. Instead all possible pairs are considered and the one that leads to the greatest 
improvement in DCF is accepted. This is known as a searching process, as it searches 

20 through possible signal pairs. Figure 7 illustrates the procedure for three signals A, B and 
C. The signals are replicated n(n - 1)/2 times where n is the number of signals: n = 3 in 
the present example, which coincidentally means there are three replicas of the signals. In 
an initial phase, using a first replica of the three signals A, B and C, at 100 signals A and B 
are processed with SBR2, and, in this example using elementary paraunitary matrices with 

25 one set of delays, signal C is aligned in time with the output so produced. Here again a (or 
as the case may be each) two channel elementary paraunitary matrix produced by the 
application of SBR2 to two signal channels is extended to an n channel elementary 
paraunitary matrix that delays (but does not rotate) the or each, remaining signal C to 
preserve time alignment with SBR2-processed signals A and B. 

30 For a second replica, at 101 signals A and C are processed with SBR2, and signal B is 
. aligned with the output. For a third replica, at 102 signals B and C are processed with 
SBR2 at 94, and signal A is aligned with the output. At 103, the DCFs of the three output 
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signal groups are determined: the group with the greatest DCF is selected, and if the DCF 
shows an improvement over the input signals to this stage, then the operation is accepted. 

As before, if the DCF of all the signals has been improved, processing continues with new 
current signals and new current matrix being fed back via a loop 104 to 100, 101 and 102. 
5 If there has been no significant improvement, processing terminates and at 105 the 
current signals and paraunitary matrix are output. 

Again the SBR2 steps 100, 101 and 102 may consist of applying a full SBR2 process of 
calculating a series of, elementary paraunitary matrices until no improvement is obtained, 
or just finding one elementary paraunitary matrix. The latter case is equivalent to a 
10 restricted version of a full SBFt: i.e. ignoring time alignment, the delays are restricted to 
only delaying one channel and the rotations to only applying a rotation to the delayed 
channel and one other. This reduces the size of the delay set and hence increases the 
speed of computation of elementary paraunitary matrices. 

Although the elementary paraunitary matrices described with reference to Figure 2 are 
15 sufficient for implementing the invention, practical applications of the invention may work 
better if an extended definition of elementary paraunitary matrices is used. A more general 
definition of an elementary paraunitary matrix is a polynomial matrix which represents 
applying an initial set of delays to input signals, then applying a single unitary 
transformation (rotation) to the delayed input signals, followed by applying a further set of 
20 delays to the delayed and rotated input signals. The initial set of delays is defined as the 
leading delays and the further set of delays as the terminal delays. 

Figure 8 illustrates a single elementary paraunitary matrix implementation 120 of this new 
form: as in Figures 2 and 3 this implementation is shown for a two channel case for the 
sake of simplicity. In the implementation 120, signals in upper and lower channels 121 and 
25 122 pass into a matrix of four ganged single pole double throw switches 123 controlling 
routing or otherwise' via a delay cell Z*. The switches 123 are all either in the UP position 
or in the DOWN position: when DOWN, upper channel signals are delayed at Z* but lower 

channel signals are not, and when UP the reverse is the case. This enables either channel 

i 

121 or 122 to be delayed relative to the other. 

30 On leaving the switches 123, an upper channel signal is multiplied by Givens rotation 
parameters -Sj and q at amplifiers 124 and 125 respectively. Similarly, on leaving the 
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switches 123, a lower channel signal is multiplied by Givens rotation parameters S\ and Ci 
at amplifiers 126 and 127 respectively. Here Sj and c,are respectively the sine and cosine 
of a rotation angle 9j implemented at 120. Each q product is summed with the Sj or -si 
product involving the signal from the other channel: this provides sums which pass along 
5 respective channels 128 and 129 to another matrix of four ganged single pole double 
throw switches 130 controlling routing or otherwise via a delay cell Z AI . These operate in a 
similar way to the switches 123. 

A paraunitary matrix can be constructed by replacing each of the dotted line blocks V 0 (z) 
to V N (z) in Figure 2 by a respective implementation 120 of the new elementary paraunitary 
10 matrix: each of these blocks 120 or Vi(z) (i = 0, 1, to N) represents a respective single 
elementary paraunitary matrix that implements a respective leading delay dj, Givens 
rotation through an angle 3 and terminal delay Aj. 

A full paraunitary matrix parameterisation in accordance with the invention in terms of the 
new elementary paraunitary matrices can be expressed as follows:- 

15 H„ (*) = V dfo6 ^ N (*)• • • V^ iA (z)V Wo (Z) (23) 

Although the full specification of an elementary paraunitary matrix includes an arbitrary 
terminal set of delays parameterised by Aj, the invention works best if one of two specific 
types of elementary paraunitary matrix are used. Type 1 elementary paraunitary matrices, 
20 which were described with reference to Figure 2, have no (or zero) terminal delays, i.e. 
A. = 0 . Type 2 elementary paraunitary matrices have terminal delays which apply an 
inverse of respective leading delays, i.e. A f = -d, . It will be recalled that a negative 
delay in one channel is implemented by a positive delay in the or each (as the case may 
be) other channel. 

25 The process of the invention using the new or type 2 elementary paraunitary matrices, to 
find an appropriate paraunitary matrix that imposes wide sense decorrelation on the 
signals is very similar to that previously described. The appropriate paraunitary matrix is 
not parameterised as disclosed by Vaidyanathan, but it is instead parameterised in terms 
of elementary paraunitary matrices (see Equation (23)) calculated sequentially. Once 

30 derived, each elementary paraunitary matrix is applied to the signals used to calculate it: 
this transforms the signals into a form suitable for calculating the next such matrix. 
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A delay pair is defined as a pair of delays, one the leading delay and the other the terminal 
delay, used in an elementary paraunitary matrix. To calculate an elementary paraunitary 
matrix a set of possible delay pairs is considered. A latest elementary paraunitary matrix is 
chosen which maximises the DCF calculated for signals transformed by application of all 

5 such matrices calculated up and including the latest. To identify the latest elementary 
paraunitary matrix its delay and rotation parameters are required. There is a countable 
infinity of different possible delays, but an uncountable number of different rotation 
parameters. Realistically the countable infinity of possible delays will be reduced to a finite 
number of different delays to be considered, either by constraining them or by making the 

10 assumption of circulant statistics and hence constraining them to be less than the data 
length (number of samples of each signal used in decorrelation). If type 1 or type 2 
elementary paraunitary matrices (as defined above) are to be used the terminal delays are 
fixed by the value of the leading delays. 

Once a trial value of leading delay has been applied to the signals, rotation parameters Sj 
15 and q are chosen. These parameters could be chosen by using a technique which 
explicitly maximises the DCF of output signals. 

It is therefore possible to look at a set of trial delay pairs and for each delay pair to attach 
the rotation that maximised the DCF after the application of the leading delay, the rotation 
and the terminal delay. This produces a set of possible elementary paraunitary matrices, 
20 each associated with a respective DCF found by applying it to signals and finding the DCF 
of the resulting output. Under certain assumptions it may be possible to avoid applying the 
elementary paraunitary matrices to the signals, and instead to calculate the DCF of the 
resulting output directly. This has computational advantages, but is only possible in some 
cases. 

25 The elementary paraunitary matrix that is selected is that having the best DCF output. This 
stage is repeated until there is no gain in DCF to be found, or the gain in DCF is regarded 
as being too small to warrant the extra complexity in the strongly decorrelating matrix. 

The iterative process for calculating a strongly decorrelating paraunitary matrix H(z) as 
shown in Figure 3 remains nearly the same for the new or type 2 elementary paraunitary 
30 matrices. The only difference is that to obtain the latest elementary paraunitary matrix at 
61, it is necessary to derive parameters which characterise such a matrix, delay 
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parameters d and A and a rotation parameter 0 . Thus identification of the elementary 
paraunitary matrix reduces to finding (d,8,A). The parameters d and A are vector 
parameters, and 0 can be a vector parameter if there are more than two signals. In the 
case of type 2 elementary paraunitary matrices, A is entirely dependent upon d , and so 
5 once d has been found (as described earlier) A has also been found. 

The additional parameter A results in a small change as compared to the process 
described with reference to Figure 4. This change is illustrated in Figure 9, which shows a 
processing channel 140| (i = 1 to S) to replace channel 72\ in Figure 4. Ignoring the fact 
the delay at 74 has been called the leading delay at 142,-, the only difference between the 
10 channels 72\ and 142| is that the latter includes provision for application of a terminal 
delay Aj at 144 { after rotation at 146j and before DCF calculation at 148]. 

The equations and procedures given in the foregoing description can clearly, be 
implemented by an appropriate computer program comprising program instructions on an 
appropriate carrier medium and running on a conventional computer system. The carrier 
15 medium may be a memory, a floppy or compact or optical disc or other hardware recordal 
medium, or an electrical signal. Such a program is straightforward for a skilled 
programmer to implement from the foregoing description without requiring invention, 
because it involves well known computational procedures 



